Self-supervised Short-text Modeling through Auxiliary Context Generation
نویسندگان
چکیده
Short text is ambiguous and often relies predominantly on the domain context at hand in order to attain semantic relevance. Existing classification models perform poorly short due data sparsity inadequate context. Auxiliary context, which can provide sufficient background regarding domain, typically available several application scenarios. While some of existing works aim leverage real-world knowledge enhance short-text representations, they fail place appropriate emphasis auxiliary Such do not harness full potential sources. To address this challenge, we reformulate as a dual channel self-supervised learning problem (that leverages context) with generation network corresponding prediction model. We propose framework, Pseudo-Auxiliary Context for Short-text Modeling (PACS) , comprehensively it jointly learned an end-to-end manner. Our PACS model consists two sub-networks: Generation Network (CGN) that context’s distribution Prediction (PN) map features final class label. experimental results diverse datasets demonstrate outperforms formidable state-of-the-art baselines. also performance our cold-start scenarios (where contextual information non-existent) during prediction. Furthermore, interpretability ablation studies analyze various representational captured by individual contribution its modules overall PACS, respectively.
منابع مشابه
Short and Sparse Text Topic Modeling via Self-Aggregation
The overwhelming amount of short text data on social media and elsewhere has posed great challenges to topic modeling due to the sparsity problem. Most existing attempts to alleviate this problem resort to heuristic strategies to aggregate short texts into pseudo-documents before the application of standard topic modeling. Although such strategies cannot be well generalized to more general genr...
متن کاملImprovements to context based self-supervised learning
We develop a set of methods to improve on the results of self-supervised learning using context. We start with a baseline of patch based arrangement context learning and go from there. Our methods address some overt problems such as chromatic aberration as well as other potential problems such as spatial skew and mid-level feature neglect. We prevent problems with testing generalization on comm...
متن کاملSupervised Term Weighting Metrics for Sentiment Analysis in Short Text
Term weighting metrics assign weights to terms in order to discriminate the important terms from the less crucial ones. Due to this characteristic, these metrics have attracted growing attention in text classification and recently in sentiment analysis. Using the weights given by such metrics could lead to more accurate document representation which may improve the performance of the classifica...
متن کاملBayesian Supervised Domain Adaptation for Short Text Similarity
Identification of short text similarity (STS) is a high-utility NLP task with applications in a variety of domains. We explore adaptation of STS algorithms to different target domains and applications. A two-level hierarchical Bayesian model is employed for domain adaptation (DA) of a linear STS model to text from different sources (e.g., news, tweets). This model is then further extended for m...
متن کاملSemi-supervised Clustering for Short Text via Deep Representation Learning
In this work, we propose a semi-supervised method for short text clustering, where we represent texts as distributed vectors with neural networks, and use a small amount of labeled data to specify our intention for clustering. We design a novel objective to combine the representation learning process and the kmeans clustering process together, and optimize the objective with both labeled data a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Intelligent Systems and Technology
سال: 2022
ISSN: ['2157-6904', '2157-6912']
DOI: https://doi.org/10.1145/3511712